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Abstract 

Epilepsy or seizure disorder is one of the most common neurological disorders. Epilepsy is characterized by unforeseeable 
seizures. The cause of these seizures is often totally unknown. Epileptic discharges normally appear on the Electroencephalogram 
(EEG) signal which represents the brain’s electrical activities. EEG signal analysis is the standard approach used in the detection 
and prediction of epileptic seizures. A novel automated system for the identification of epilepsy patients and detection of seizures 
is proposed in this study. Statistical moments are used to reduce the dimension of the input EEG signal and to obtain distinctive 
features from it which are subsequently fed to a modified Support Vector Machine (SVM) algorithm for classification (epileptic 
or not epileptic). Experimental tests show that the standard deviation and mean values of the input EEG signal form robust 
features. Testing the performance of the proposed system on a publicly available epilepsy dataset provided by the University of 
Bonn, achieved 100% accuracy. The proposed system requires up to 83% fewer clock cycles than the lift algorithm and 88% 
fewer clock cycles than the convolution-based algorithm. 
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Introduction 

The Epilepsy or seizure disorder is one of the most Electrical activities of the brain can be seen by the 
common neurological disorders [1]. About 2% of the  electroencephalogram (EEG) [10,11]. In addition to 
world’s population are affected by epilepsy [2]. its various applications in the medical fields, the EEG 
Epilepsy is characterized by unforeseeable seizures. 1s considered the most useful and significant test for 
The cause of these seizures may be related to a checking if someone has epilepsy [12]. . 
family trend or a brain injury but is often totally Recording of the EEG signal can be done either 
unknown [3]. If a person has one or more seizures, invasively or non-invasively [13]. In the non- 
then that person is diagnosed with epilepsy unless the invasive method, the EEG is directly recorded from 
seizures are caused by some known medical the scalp. Here multiple-channel EEG signals are 
conditions [4]. In other words, if a person has a recorded simultaneously with electrodes placed on 
seizure, it does not necessarily mean that he or she the scalp [14]. A gel is often applied in order to 
has epilepsy. Non-epileptic seizures may happen decrease the electrical resistance between the 
because of several reasons including brain tumors, electrodes and the skin. Usually, 19 electrodes in 
stroke, head injury, and birth defects [5]. addition to system reference and a ground are 
The electrical events that produce the symptoms of a attached to the scalp and arranged in a specific order 
seizure occur in the brain. Specifically, an excessive following the International 10-20 system [15]. 
neuronal discharge and an unanticipated electrical Figure 1 depicts the electrode locations of the 
disturbance of the brain cause the seizures [6]. The International 10-20 system for EEG recording. 
unpredictable nature of seizures will have a Fewer electrodes are used when recording the EEG 
tremendous impact on the patient’s quality of life Signal for a neonate. . 

[7,8]. Consequently, early detection of pre-seizure is The other type of EEG recording known as 
a very important demand as it may allow the patient Intracranial Electroencephalography (EEG), is 


to take appropriate and in some cases life-saving invasive and is often captured during a surgery [17]. 
precautions [9]. Here, electrodes are implanted on the exposed 


surface of the brain to record the EEG signal directly 
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from the cerebral cortex. Most of the research work 
in the field of seizure analysis is based on the scalp 
EEG recording method. 
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Figure 1: Electrode locations of International 10-20 


system for EEG recording [16]. 







In the seizure analysis problem of epilepsy patients, 
the EEG signal is studied for the purpose of 
classifying the seizure and for the purpose of 
predicting epileptic activities before they occur. 
Visual inspection of EEG signals is a time- 
consuming process and is subject to a human 
interpretation which may lead to incorrect diagnosis 
due to various human-related factors such as fatigue. 
In this study, we propose a novel automated epilepsy 
detection system. Specifically, we use statistical 
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moments to infer discriminatory information about 
the input EEG signal which hopefully forms a valid 
representative feature vector. The input feature 
vector is then passed to Support Vector Machines 
(SVMs) for classification and labeling (epileptic or 
not epileptic). Testing the proposed system on an 
epilepsy dataset, obtained from the University of 
Bonn [18], achieved a 100% success rate. To prove 
the validity and robustness of the proposed scheme, a 
review of the accuracy rates of various methods 
employed in the literature is provided in this paper. 
The proposed system is proved through mathematical 
analysis to have a very low time complexity 
compared to the state-of-the-art methods in epilepsy 
detection. 


Materials and Methods 

The raw EEG dataset used in this study is obtained 
from Bonn University and is publicly available for 
free download [19]. The entire database is composed 
of five sets, each of which contains 100 single- 
channel surface EEG signals of 23.6s. In our dataset, 
we adopted one healthy set (seizure-free recorded 
extracranially with eyes open) and one set containing 
ictal or seizure activity, recorded intracranially from 
a volunteer patient. Hence our dataset is composed 
of 200 samples. Figure 2 shows samples from the 
used dataset of an epileptic and a seizure free 
recordings. 
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Figure 2: Examples of raw EEG recordings for (A): seizure-free and (B): seizure phases. 
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The x-axis and the y-axis in Figure 2 represent the 
time period [0 to 23.6s] and the EEG signal value in 
microvolts, respectively. The raw EEG signals were 
recorded using a 128-channel amplifier system, band- 
pass filtered using a band-pass of [0.53-40] Hz, and 
then sampled at a rate of 173.61 Hz. A block diagram 
illustrating the stages of the proposed system is 
depicted in Figure 3. The statistical features (mean 
and standard deviation) extracted from the input EEG 
signal are fed directly to an SVM classifier. 
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Figure 3: Block diagram of the proposed system. 


SVMs have supervised learning algorithms that are 
commonly used for classification and regression 
applications. An SVM, devised by Cortes et al. [20] 
is a wo-group Classifier that has been used in recent 
years to efficiently solve linear and non-linear 
classification problems [21]. Although SVMs can be 
modified to handle multiclass problems [22], they 
were originally designed to classify data composed of 
exactly two classes. In our application, SVM is used 
to classify the EEG signal into either healthy or 
epileptic. As depicted in Figure 4, an SVM classifies 
data by determining the best hyperplane that isolates 
the data points of the two classes. 





Figure 4: Support Vector Machine 


In other words, an SVM tries to find the widest 

possible margin that separates the two classes and has 

no interior data points. The SVM algorithm 

implemented here uses the Gaussian kernel defined 
IIx — yll? 


by: 
k(x, y) = exp (- 202 j 


Note: o is a user-defined variance parameter. 


(1) 
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Statistical analysis 

As illustrated in Figure 2, the normal and seizure 
EEG samples exhibit different statistical 
characteristics. For example, the mean and standard 
deviation of the normal (healthy) sample is negative 
28.4 and 42.1, respectively whereas the mean and 
standard deviation of the epileptic sample is 3.3 and 
259.9, respectively. This acute variation in statistical 
properties is a prominent factor in the motivation to 
employ statistical moments to obtain distinctive 
features from the input EEG signal. 

In this study, we explore the use of moments as valid 
features representing the input EEG signals. 
Moments are statistical measures that describe signal 
characteristics [23]. The first two moments in 
statistics are the mean and the variance which is the 
square of the standard deviation. These two moments 
are by far the most commonly used moments. The 
third and fourth moments are the skewness and the 
kurtosis, respectively. Higher-order moments (above 
the 4th moment) are not easily described or estimated 
[24]. Moments have found several practical 
applications in the field of digital signal processing 
(DSP) including pattern recognition, image encoding, 
and pose estimation. Sarhan et al. have used the 
standard deviation measure in the Arabic character 
recognition application [25]. Amr et al. have 
employed the moments in an image retrieval 
application [26]. Boveiri has discussed the use of 
statistical moments in pattern classifications [27]. 
Teh et al. have applied the moments in image 
analysis [28]. 


Let the input EEG signal x be defined as a discrete- 
time (DT) sequence of real numbers, such that 

» Goel, 4 Ee. Oe. & Sane ene (2) 
The first moment or the sample mean is widely used 
as a measure of central tendency and is affected by 
every sample in the signal. The mean is given by the 
following expression: 


À 
X = — 


Á (3) 
The Sample Variance measures the variability and is 
given by 


E pa —x) 


— m=] 
The Sample Variance measures the variability and is 
given by (4) 
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The sample Standard Deviation is defined as the 
square root of the variance or 
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The Sample Standard Deviation is also given by the 
following expression 








(5) 


Results 

As commonly practiced in supervised machine- 
learning models, we used the Holdout Sample 
method for cross-validation. Specifically, the EEG 
data was randomly split into a training set and a 
testing set. When using only the standard deviation 
and the mean of the input samples as features, the 
proposed system produces a zero error rate. In the 
following experiment, we explore the performance of 
the system when the input EEG is contaminated with 
the additive white Gaussian noise of zero mean. 
Depicted in Figure 5 is the error rate of the proposed 
system as a function of the standard deviation of the 
noise. Fig.5 clearly illustrates that the proposed 
system is robust and produces a zero error rate for 
low levels of additive white Gaussian noise. Even 
for high levels of additive noise, the system’s error 
rate is less than 12.5 %. Compared to the state-of-the- 
art methods in epilepsy feature extraction which were 
reviewed in this study, the proposed system is 


superior in terms of the accuracy rate and 
complexities. 
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Figure 5: Error rate of the proposed system as a 
function of the noise standard deviation 
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Discussion 

Epilepsy is characterized by seizures and is highly 
unpredictable. A seizure may be epileptic or non- 
epileptic. EEG signal analysis is considered the 
standard approach used in the detection and 
prediction of epileptic seizures. Manually 
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determining the location of the seizure period in EEG 
signals is a tedious, time consuming, and difficult 
challenge. Consequently, there is a strong need for 
an automatic system for the detection and prediction 
of seizures in EEG recordings. 

Classification of EEG signals or any signal in general 
consists of two major functions: feature extraction 
and class-labeling. The purpose of the feature 
extraction stage is to obtain finite characteristics that 
are representative of the input signal. This process 
often involves compression and redundancy removal. 
In the class-labeling stage, a classifier is used to 
operate on the extracted features of the input signal 
and assign the input to a particular class. 
Classification methods can be categorized into four 
kinds: machine learning techniques such as. Fuzzy 
Network, SVM, and artificial neural network (ANN); 
statistical methods such as Bayesian classification; 
logic-based techniques such as decision trees (DT); 
and instance-based methods such as the k-nearest 
neighbor (KNN) algorithm. 

In the following survey, we aim to explore the 
various feature extraction and classification methods 
that are found in the literature of seizure detection. A 
recent review of signal processing techniques and 
classification methods in seizure analysis was 
performed by Bou Assis et al [29]. Alotaiby et al. 
categorized the EEG analysis methods into time- 
domain and frequency domain methods and provided 
a valuable survey of the EEG seizure detection and 
prediction algorithms [30]. Wei et al. used a time- 
domain method (shape similarity) for feature 
generation and the Hausdorff distance as a classifier 
to recognize epileptic discharges in EEG [31]. Patidar 
et al. worked on the diagnosis of epilepsy by 
extracting features using Kraskov entropy derived 
from Tunable Q-Factor Wavelet Transform (TQWT) 
and evaluated the system performance as a function 
of Q [32]. 

Jaiswal et al. used the Local Neighbor Descriptive 
Pattern (LNDP) and the One-dimensional Local 
Gradient Pattern (1D-LGP) techniques for feature 
extraction and passed the extracted features to an 
ANN classifier [33]. Li et al. used the dual-tree 
complex wavelet transform (DT-CWT) for feature 
extraction and SVM for classification [34]. Ekong et 
al. used a fuzzy SVM in the classification of epilepsy 
seizures [35]. Satapathy et al. used the radial basis 
function neural network (RBFNN) for epilepsy 
classification and the Daubechies wavelet at level 
four for extraction [36]. Riaz et al. employed the 
empirical mode decomposition (EMD) for extracting 


features from EEG signals and used SVM for the 
classification phase [37]. 

A body-senor network (BSN) that used signal 
statistics such the mean, variance, zero-crossing rate, 
and entropy; was developed by Dalton et al. to 
monitor and detect epileptic seizures [38]. The use of 
the Principal Component Analysis (PCA) in the 
Wavelet domain was adopted by Xie et. al. for de- 
noising and feature extraction [39]. A review of 
wavelet techniques for computer-aided seizure 
detection and epilepsy diagnosis was developed by 
Faust et al. [40]. Feature extraction using 
approximate entropy on DWT coefficients was used 
by Ocak [41] and by Guo et al. [42]. Chiu et al. 
extracted features from the EEG signal using Wavelet 
energy [43]. Line length feature was adopted by Guo 
et al. [44]. Subasi et al. extracted features by 
employing the PCA, Independent-Component 
Analysis (ICA) and Linear Discriminant Analysis 
(LDA) on DWT coefficients [45]. 

Next, we demonstrate that the proposed system 
outperforms the methods described in the literature of 
epilepsy analysis in terms of two main merit 
measures, accuracy, and complexity. First, as was 
just discussed in the literature review part of this 
paper, the accuracies of the epilepsy analysis 
methods are lower than the 100% accuracy rate 
achieved by the proposed system. 

Second, we provide a time complexity comparison of 
the feature extraction stage of the proposed system 
and the methods introduced in the literature. There 
are two approaches for evaluating algorithm 
complexities [46]. The first method is based on 
analyzing the written algorithm to count the main 
operations [47]. The second method is based on 
running the algorithm on a PC to measure the time 
and memory costs. Of course, the latter method is 
not widely used as it is platform-dependent and the 
result will vary when the algorithm is run across 
different platforms. 

The calculation cost in the feature extraction stage of 
the proposed system depends only on calculating the 
mean and the standard deviation of the input 
sequence. 

Postulate 1: The time complexity Tm[n] for 
computing the mean function (Eq. 3) is given by: 
Tm[n] = n+1 = O(n) 

Where n is the length of the 1-D input sequence. 
To calculate the time complexity Ts[n] for evaluating 
the standard deviation function, rewrite Eq.4 as: 
paa _ HRY - DAW 5) 


Where the constant 1/(m-1) has been neglected 


(6) 
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Postulate 2: The time complexity Ts[n] for 
computing the variance function (Eq. 4) is given by: 
Ts[n] = n +n +2n = 2n? +2n =O(n’) (7) 
Thus the total time complexity of the proposed 
systems is 

T[n] = Ts[n] + Tm[n] = 2n° +2n + n+] 

T[n] = 2n +3n +1 = O(n’) + O(n)+ O(1) = O(n’) (8) 
To provide a clock cycle cost analysis, we assume 
that the input samples are stored as 16-bit fixed point 
numbers. Therefore, when the algorithms are 
implemented in a DSP microcontroller, the cycles per 
instruction cost are listed in Table 1. 


Table 1: Cycles per operation for a 16-bit fixed point number 


Number of cycles 
subtraction/addition 1 clock cycle 


multiplication by 2 1 clock cycle 
Multiplication 10 clock cycles 





The discrete wavelet transform of an input vector of 
length m returns a vector of length m, the same 
length as the input. It can be deduced from Eq. 8 that 
the proposed system will require the following 
operations to transform an m-sample signal: 


Postulate 3: The number of operations required by 
the proposed system to transform an input sequence 


of length m: 
Total operations= m multiplications + 2m 
multiplication by a constant + 3m additions (9) 


Using Eq. 9 and Table 1, the number of clock cycles 
required by the proposed system is: 

C[m] = 10m +2m +3m cycles. Thus the number of 
cycles required by the proposed system to perform a 
WT of an m-sample input sequence is given by 


C[m] = 15m clock cycles for an input sequence of m 
samples. (10) 

Next, we compare the time complexity of the 
proposed system with the Wavelets complexity as 
Wavelet is the most commonly used algorithm in the 
feature extraction stage of epilepsy analysis and is 
widely considered the state-of-the-art approach in 
this field. First, one disadvantage of using the 
Wavelet transform (WT) to perform compression is 
that it has a higher numerical cost compared to the 
other transforms such as the Discrete Cosine 
transform (DCT) and the Fourier transform (FT) 
[48,49]. WT is normally computed using lift 
algorithms [50, 51] or convolution-based algorithms 
[52]. 


Next, we provide the number of operations 
(additions, subtractions, and multiplications) that are 
needed by the lift and convolution-based algorithms 
to calculate the Daubechies (DB) WT. Here the 
algorithms are used to calculate a single step of DB- 
WT. A single step refers here to the calculation of 


two output samples of a Wavelet transform based on 
two input samples. Table 2 depicts the required 
operations of the lift-based and convolution-based 
algorithms, for evaluating the DB-WT. 

It can be seen from Table 2 that the lift and 
convolution-based algorithms, when transforming an 
m-sample sequence, require the following operations: 


Table 2: Number of operations required to compute the Wavelet Transform 


Algorithm Multiplication Multiplication 2 Subtraction/addition 


Lift-based 





Convolution-based | 12, | | l0 O O 


Lift-based algorithm 
additions 


8m multiplications and 6m 


(11) 


Convolution-based algorithm = 12m multiplications 
and 10m additions (12) 


Using Table 1, the clock cycle costs of the Lift-based 
and Convolution-based algorithms are: 

C[m] = 86 clock cycles Lift-based algorithm (13) 
C[m]= 130 clock cycles convolution-based algorithm 
(13) 

Assuming a 16-bit integer value the proposed system 
requires up to 83% fewer clock cycles than lift 
algorithm and 88% fewer clock cycles than a 
convolution-based algorithm. 


Conclusion 

A novel approach to the detection of epileptic 
seizures in EEG signals is introduced in this study. 
The system is based on extracting statistical features 


from the EEG signal and applying the features to an 
SVM for classification. Experimental tests showed 
that the standard deviation and mean values of the 
input EEG signal form robust features. Simulations 
showed that the proposed system achieved 0% error 
rate. The experiments also revealed that when EEG 
signals are corrupted with a high-level white 
Gaussian noise, the proposed system still achieves a 
small error rate of about 15%. 

To show the validity of the proposed system, the 
performance of other systems proposed in the 
literature of EG analysis is discussed and the 
accuracies of other methods found in the literature 
are listed in Table 3. The main merits of the 
proposed systems are its low complexities and high 
accuracy compared to the state-of-the-art methods 
which were reviewed in this work. Assuming a 16-bit 
integer value the proposed system requires up to 83% 
fewer clock cycles than lift algorithm and 88% fewer 
clock cycles than a convolution-based algorithm. 


Table 3: Accuracies of seizure classification methods proposed in the literature 


Methods 
K-NN classifier and Wavelet features 


K-NN classifier and Genetic Programming 


ANN and PCA 

Expert model and Wavelet transform 
Histograms and SVM 

Gaussian mixture model and Wavelets 
linear least squares models 

wavelet based and statistical features 
PCA and ANN 


phase space representation (PSR) for feature 


extraction and Least squares SVM for 
classification 

LDA and SVM classifier 

Wavelets and nearest neighbor classifier 
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